high-level decision
Agent Guide: A Simple Agent Behavioral Watermarking Framework
Huang, Kaibo, Zhang, Zipei, Yang, Zhongliang, Zhou, Linna
The increasing deployment of intelligent agents in digital ecosystems, such as social media platforms, has raised significant concerns about traceability and accountability, particularly in cybersecurity and digital content protection. Traditional large language model (LLM) watermarking techniques, which rely on token-level manipulations, are ill-suited for agents due to the challenges of behavior tokenization and information loss during behavior-to-action translation. To address these issues, we propose Agent Guide, a novel behavioral watermarking framework that embeds watermarks by guiding the agent's high-level decisions (behavior) through probability biases, while preserving the naturalness of specific executions (action). Our approach decouples agent behavior into two levels, behavior (e.g., choosing to bookmark) and action (e.g., bookmarking with specific tags), and applies watermark-guided biases to the behavior probability distribution. We employ a z-statistic-based statistical analysis to detect the watermark, ensuring reliable extraction over multiple rounds. Experiments in a social media scenario with diverse agent profiles demonstrate that Agent Guide achieves effective watermark detection with a low false positive rate. Our framework provides a practical and robust solution for agent watermarking, with applications in identifying malicious agents and protecting proprietary agent systems.
- Information Technology > Security & Privacy (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Huang, Sili, Hu, Jifeng, Chen, Hechang, Sun, Lichao, Yang, Bo
In-context learning is a promising approach for offline reinforcement learning (RL) to handle online tasks, which can be achieved by providing task prompts. Recent works demonstrated that in-context RL could emerge with self-improvement in a trial-and-error manner when treating RL tasks as an across-episodic sequential prediction problem. Despite the self-improvement not requiring gradient updates, current works still suffer from high computational costs when the across-episodic sequence increases with task horizons. To this end, we propose an In-context Decision Transformer (IDT) to achieve self-improvement in a high-level trial-and-error manner. Specifically, IDT is inspired by the efficient hierarchical structure of human decision-making and thus reconstructs the sequence to consist of high-level decisions instead of low-level actions that interact with environments. As one high-level decision can guide multi-step low-level actions, IDT naturally avoids excessively long sequences and solves online tasks more efficiently. Experimental results show that IDT achieves state-of-the-art in long-horizon tasks over current in-context RL methods. In particular, the online evaluation time of our IDT is \textbf{36$\times$} times faster than baselines in the D4RL benchmark and \textbf{27$\times$} times faster in the Grid World benchmark.
- Europe > Austria > Vienna (0.14)
- Asia > China (0.04)
- North America > United States > Pennsylvania > Northampton County > Bethlehem (0.04)
A hierarchical control framework for autonomous decision-making systems: Integrating HMDP and MPC
Wang, Xue-Fang, Jiang, Jingjing, Chen, Wen-Hua
This paper proposes a comprehensive hierarchical control framework for autonomous decision-making arising in robotics and autonomous systems. In a typical hierarchical control architecture, high-level decision making is often characterised by discrete state and decision/control sets. However, a rational decision is usually affected by not only the discrete states of the autonomous system, but also the underlying continuous dynamics even the evolution of its operational environment. This paper proposes a holistic and comprehensive design process and framework for this type of challenging problems, from new modelling and design problem formulation to control design and stability analysis. It addresses the intricate interplay between traditional continuous systems dynamics utilized at the low levels for control design and discrete Markov decision processes (MDP) for facilitating high-level decision making. We model the decision making system in complex environments as a hybrid system consisting of a controlled MDP and autonomous (i.e. uncontrolled) continuous dynamics. Consequently, the new formulation is called as hybrid Markov decision process (HMDP). The design problem is formulated with a focus on ensuring both safety and optimality while taking into account the influence of both the discrete and continuous state variables of different levels. With the help of the model predictive control (MPC) concept, a decision maker design scheme is proposed for the proposed hybrid decision making model. By carefully designing key ingredients involved in this scheme, it is shown that the recursive feasibility and stability of the proposed autonomous decision making scheme are guaranteed. The proposed framework is applied to develop an autonomous lane changing system for intelligent vehicles.
- Europe > United Kingdom > England > Leicestershire (0.14)
- Europe > Netherlands (0.14)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.68)
- Energy > Oil & Gas > Downstream (0.62)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)
Learning Based High-Level Decision Making for Abortable Overtaking in Autonomous Vehicles
Malayjerdi, Ehsan, Alcan, Gokhan, Kargar, Eshagh, Darweesh, Hatem, Sell, Raivo, Kyrki, Ville
Autonomous vehicles are a growing technology that aims to enhance safety, accessibility, efficiency, and convenience through autonomous maneuvers ranging from lane change to overtaking. Overtaking is one of the most challenging maneuvers for autonomous vehicles, and current techniques for autonomous overtaking are limited to simple situations. This paper studies how to increase safety in autonomous overtaking by allowing the maneuver to be aborted. We propose a decision-making process based on a deep Q-Network to determine if and when the overtaking maneuver needs to be aborted. The proposed algorithm is empirically evaluated in simulation with varying traffic situations, indicating that the proposed method improves safety during overtaking maneuvers. Furthermore, the approach is demonstrated in real-world experiments using the autonomous shuttle iseAuto.
- Europe > Estonia > Harju County > Tallinn (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > Idaho > Ada County > Boise (0.04)
- (3 more...)
- Transportation > Ground > Road (1.00)
- Automobiles & Trucks (1.00)
Like It Or Not, Artificial Intelligence Is Coming To Every Part Of Retail
Artificial intelligence will be important at every level of retail. When I talk to retailers about artificial intelligence, their eyes glaze over, like I'm speaking a foreign language and very few people want to talk about it. AI is going to pervade almost every aspect of retail, big and small. Here's a case in point: The EPA estimates that a supermarket of 50,000 square feet, that's a large store but not excessively so, uses about $200,000 worth of electricity and natural gas in the course of a year. According to the EPA, about half of that cost is in refrigeration and lighting.
- Retail (0.60)
- Energy (0.50)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.38)
Skepticism Abounds For Artificial Intelligence In High-Level Decisions
Many decision-makers are skeptical about AI. What types would brush aside AI in favor of their own conclusions? When it comes to high-level strategic decisions, many executives still will go with their gut, and not the machine. Is this a good thing? AI is starting to play a key part in many things: customer personalization, sales recommendations, financial portfolio recommendations, aircraft collision avoidance, semi-autonomous vehicles, and medical screening.
What makes a good conversation?
This blog post is about the NAACL 2019 paper What makes a good conversation? How controllable attributes affect human judgments by Abigail See, Stephen Roller, Douwe Kiela and Jason Weston. On the left are tasks like Machine Translation (MT), which are less open-ended (i.e. Given the close correspondence between input and output, these tasks can be accomplished mostly (but not entirely) by decisions at the word/phrase level. On the right are tasks like Story Generation and Chitchat Dialogue, which are more open-ended (i.e. For these tasks, the ability to make high-level decisions (e.g.
Dynamic Input for Deep Reinforcement Learning in Autonomous Driving
Huegle, Maria, Kalweit, Gabriel, Mirchevska, Branka, Werling, Moritz, Boedecker, Joschka
In many real-world decision making problems, reaching an optimal decision requires taking into account a variable number of objects around the agent. Autonomous driving is a domain in which this is especially relevant, since the number of cars surrounding the agent varies considerably over time and affects the optimal action to be taken. Classical methods that process object lists can deal with this requirement. However, to take advantage of recent high-performing methods based on deep reinforcement learning in modular pipelines, special architectures are necessary. For these, a number of options exist, but a thorough comparison of the different possibilities is missing. In this paper, we elaborate limitations of fully-connected neural networks and other established approaches like convolutional and recurrent neural networks in the context of reinforcement learning problems that have to deal with variable sized inputs. We employ the structure of Deep Sets in off-policy reinforcement learning for high-level decision making, highlight their capabilities to alleviate these limitations, and show that Deep Sets not only yield the best overall performance but also offer better generalization to unseen situations than the other approaches.